The Bispectrum of IRAS Redshift Catalogs 

Roman Scoccimarro/'^ Hume A. Feldman,'^ J. N. Fry,^ and Joshua A. Frieman^'' 



ABSTRACT 



We compute the bispectrum for the galaxy distribution in the IRAS QDOT, 2 Jy, 
and 1.2 Jy redshift catalogs for wavenumbers 0.05 < k < 0.2/iMpc^^ and compare the 
results with predictions from gravitational instability in perturbation theory. Taking 
into account redshift space distortions, nonlinear evolution, the survey selection func- 
tion, and discreteness and finite volume effects, all three catalogs show evidence for the 
dependence of the bispectrum on configuration shape predicted by gravitational insta- 
bility. Assuming Gaussian initial conditions and local biasing parametrized by linear 
and non-linear bias parameters bi and 621 a likelihood analysis yields I/61 = 1.32]tQ'58, 
1.15lg:|^ and 62/6? = -0.571^;^^, -0.50l[]Ji, for the for the 2 Jy and 1.2 Jy samples, 
respectively. This implies that IRAS galaxies trace dark matter increasingly weakly 
as the density contrast increases, consistent with their being under-represented in clus- 
ters. In a model with non-Gaussian initial conditions, the bispectrum displays an 
amplitude and scale dependence different than that found in the Gaussian case; if IRAS 
galaxies do not have bias 61 > 1 at large scales, non-Gaussian initial conditions 
are ruled out at the 95% confidence level. The IRAS data do not distinguish between 
Lagrangian or Eulerian local bias. 

Subject headings: cosmology: observations — methods: statistical — large-scale struc- 
ture of universe 
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1. Introduction 

Observations suggest that the distribution of matter on cosmological scales is nearly homo- 
geneous, with structure that in our present understanding has grown by gravitational instability 
from small, nearly Gaussian, initial fluctuations. On large scales, departures from homogeneity are 
small, and their evolution can be studied in perturbation theory. To linear order in perturbation 
theory, an initially Gaussian fluctuation distribution remains Gaussian, but at higher orders this is 
no longer true. Details of the departures depend on the nonlinear couplings; thus, non-Gaussianity 
can probe the dynamics of structure formation. One useful non-Gaussian measure is the three-point 
correlation function, or the bispectrum in the Fourier domain, a quantity that vanishes identically 
for a Gaussian distribution but is nonzero and has a characteristic behavior as a function of the 
shape of the triangle defined by its three wavevector arguments if large-scale structure is formed as 
a result of gravitational instability (Fry 1984). When applied to the galaxies, the bispectrum also 
depends on galaxy biasing, i.e. on the relation between the galaxy and mass distributions (Fry & 
Gaztahaga 1993; Fry 1994) and on any non-Gaussianity of the primordial density fluctuations (Fry 
& Scherrer 1994). 

Early measurements of the bispectrum in the Lick galaxy catalog did not find the expected 
gravitational instability signature (Fry & Seldner 1982). In angular catalogs such as Lick, the 
density field is integrated over the radial selection function, which in principle can change the 
dependence on shape, although it appears that in practice the effects of projection on the shape 
dependence of the bispectrum are not large (Fry & Thomas 1999; Buchalter, Kamionkowski, & 
Jaffe 2000). Results on the moderate scales probed by this catalog are also affected by nonlinear 
evolution beyond leading order in perturbation theory, which weakens the configuration dependence 
of the bispectrum in a way that mimics the effects of bias (Scoccimarro et al. 1998). A more 
recent determination of the angular three-point correlation function in the APM survey (Prieman 
&; Gaztanaga 1999), which probes significantly larger scales than Lick, shows an amplitude and 
configuration dependence consistent with gravitational instability from Gaussian initial conditions 
and no bias, though with large errors. 

Recent galaxy redshift surveys have mapped the distribution of galaxies out to large distances. 
The largest distances are obtained for deep pencil beam and slice surveys; however, surveys with 
highly anisotropic one- and two-dimensional geometries have window functions which extend to 
large wavenumber in one or two directions, corresponding to the smallest survey dimension(s). 
As a consequence, nonlinearity on this scale tends to wash out the configuration dependence of 
three-point statistics. Thus, although the three-point correlation function has been measured in 
the Las Campanas Redshift Survey (Jing & Borner 1998), the survey geometry is effectively two- 
dimensional, with redshift slices of thickness about 10/i~^Mpc; this scale is not large enough to 
allow a direct comparison with perturbation theory. Thus, it is very difficult to extract quantitative 
information from this survey about biasing and the nature of the initial conditions. 

To compare with predictions of the configuration dependence of the bispectrum from pertur- 
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bation theory, we wish to observe the bispectrum directly in an analysis that to the extent possible 

is not affected by the complications of projection and nonlinear evolution. In this paper, we report 
measurement of the bispectrum in several galaxy redshift catalogs derived from the IRAS Point 
Source Catalog (Chester, Beichmann, & Conrow 1987). Working with three-dimensional (redshift) 
data avoids the effects of radial projection. Moreover, the large volume and nearly full sky coverage 
of these surveys allow us to make measurements at small wavenumber (large scales), where clus- 
tering is well-described by leading order perturbation theory. Although working in redshift space 
introduces distortions arising from the modulation of position induced by peculiar velocities, these 
can be computed (Hivon et al. 1995; Verde et al. 1998; Scoccimarro, Couchman, &; Frieman 1999). 

In Section 2 we review the leading order nonlinear perturbation theory predictions for the 
bispectrum, including its dependence on configuration shape, the distortions introduced by peculiar 
velocities in redshift space, and the effects of a bias in the galaxy distribution relative to matter. 
In Section 3 we summarize properties of the IRAS-hased redshift catalogs, and we present our 
estimators in Fourier space, using the optimal weighting of Feldman, Kaiser, &; Peacock (1994, 
FKP). In Section 4 we summarize the data analysis method based on comparison with mock 
catalogs developed in the companion paper (Scoccimarro 2000b, Paper I), and in Section 5 we 
present the results of our analysis. Section 6 contains a final discussion. 



2. The Bispectrum in Perturbation Theory 

We compute statistics in the Fourier domain, since it naturally separates clustering on different 
physical scales. The Fourier amplitude 6{k), the transform of the fractional density contrast S{x), 
is defined by 

~S{k) = J d'x5{x)e-'^-^, Six) = J ^5»e*^ (1) 

where the mass density is p{x) = p[l + S{x)]. The first moments of S{k) are the power spectrum 
and the bispectrum, 

{S{ki)S{k2)) = {2TrfSj,{ki + k2)P{ki), (2) 
{S{ki)S{k2)6{k3)) = {27rfSB{ki + k2 + k3)B{ki,k2,k3). (3) 

The "momentum-conserving" Dirac (5-function reflects statistically the underlying homogeneity and 
implies that the bispectrum is defined for configurations of wavenumber s that form closed triangles. 
If the initial fluctuations are Gaussian, then the bispectrum vanishes in linear perturbation theory; 
however, mode-mode couplings when nonlinearities are included generate a non-zero B even for an 
initially Gaussian distribution. 

In gravitational instability with Gaussian initial conditions, the bispectrum at second (leading 
nonzero) order in perturbation theory is (Fry 1984) 



Bi23 = Q12P1P2 + Q13-P1-P3 + Q23P2P3, 



(4) 
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where Pi denotes P{ki), etc., and B123 = B{ki,k2,ks). Here Qij is determined entirely by the 
configuration geometry, 



Qij = {1 + k) + cos % ( 77- + "T" ) + (1 - 't) cos^ 9ij, (5) 



where cos 9ij = fcj • fcj. The factor k is k = | in an Einstein-de Sitter universe, with critical matter 
density S7m = 1, and it never varies much from this value. For an open universe with near 
1, K |i7m^^^^, and for a flat universe with cosmological constant, k ^0,^^^^^ (Bouchet et al. 
1992, 1995); while in both cases k — | as 0. 

It is useful to define the reduced bispectrum, 

^123 

^ P1P2 + P1P3 + P2P3 ^ ' 

For a power-law spectrum, P oc A;", at second order in perturbation theory, the reduced bispectrum 
Q is independent of time and of scale leaving only the dependence on triangle shape. This behavior 
contrasts sharply with that of P and B, which vary strongly with scale. Moreover, the expressions 
above imply that Q is very insensitive to the value of ^Im- The solid curve in Figure 1 shows 
the characteristic dependence of Q on configuration shape, here parameterized by the angle 6*12 
between two sides of ratio ^2/^1 = 5, for a power-law spectrum with spectral index n = — 1.4 and 
for rijji = 1. This ratio of sides minimizes the total range of scales covered as varies from to tt, 
but in fact Q is insensitive to the ratio ^2/^1 for ^ < 7r/2. The characteristic dependence on shape, 
with an enhancement for colinear configurations (9 = or 9 = n), reflects the expected anisotropy 
of gravitational collapse. The dotted curve shows the behavior for fi^^ 0. Other curves show 
redshift space and bias results, which we discuss next. 

In redshift space, where radial distances are assigned from recession velocities by Bubble's 
law, peculiar velocities induce distortions in clustering statistics (Kaiser 1987; for a review, see 
Hamilton 1997). In the plane parallel approximation, where the line of sight is taken as a fixed 
direction z, the density in space S{k) and redshift space Ss{k) are related to linear order by 6s{k) = 
[l-|-/i(r2)/x^] 5{k), where /i = Jl^/^ and = kz/k. Averaging over orientations, the power spectrum 
monopole in redshift space then becomes 

P3(fc)=(l + ^/i + ^/f)p(fe), (7) 

while the bispectrum monopole is of the same form as as in equation (4) above, with (Hivon et al. 
1995) 

/ k- k- 

Qij,s = 00 + ^2 cos^ 9ij + 04 cos'' 9ij + (oi cos 9ij + as cos^ ^ij) ( 7^ + ^ 



+ cos^ 9ij) 



kf + k'^ + 2kikj cos 9, 



(8) 
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where the coefficients are 

a„ = (1 +.) + ?(2 + + 1(11 + .)/f + l/f + A;f + ,;,Q + JLA + ^;f) (9) 

a. = (1 + |(2 + 1(23 + .)/f + |/f-lii/?-„A(l + lA-l/f) (11) 

°' = + + (13) 

2 4 

a^i = a;2 = ^K/i/2; (14) 

K is as given before, and /i and /2 are logaritfimic derivatives of tfie first and second order 
perturbation theory growth factors Di and D2 with respect to the cosmological scale factor, 
/i = c/logDi/dloga, etc. For near 1, /i k, $7^/''', and /2 ~ 2/i. Setting /i = /2 = 
gives ao = 1 + K, oi = 1, a2 = 1 — K, and 03 = 04 = xi = 2:2 = 0, reproducing equation (5), the 
result in real space. 

Figure 1 shows the predictions for P ~ A;" with n = —1.4, a good approximation for the IRAS 

samples over the range of scales wc use. The long-dashed curve shows the perturbation theory result 
for Qs{0) in rcdshift space for Vim = 1; the short-dashed curve Vi^ = 0.3. The behavior depends on 
ri^, but more weakly than the factor /(O) = ^'^^'^ that appears in all peculiar velocities. Between 
n = 1 and ri = 0.3, / varies by a factor of 2, but Q by at most about 15%, while the redshift 
space result for — >^ is the same as in real space. Over the entire range of the characteristic 
behavior is very much the same. 

To translate these results for the mass distribution into predictions for the galaxy distribution, 
we must take into account the effects of bias. In the simplest case, a local deterministic bias model, 
in which the galaxy density is a function of the local mass density, the galaxy number contrast can 
be expressed as a series in the mass density contrast, 

00 , 

^9i^) = Y.i^'i^)- (15) 

fe=0 

To leading order in perturbation theory, the resulting galaxy power spectrum is simply related to 
the mass power spectrum, Pg{k) = h\P{k), where 61 is the linear bias factor. In this model, the 
galaxy bispectrum amplitude is (Fry & Gaztaiiaga 1993; Fry 1994) 

Thus, the effect of 61 > 1 is to reduce the dependence of Q on triangle shape, while 62 introduces 
a constant offset. Since biasing operates in real space, instead of redshift space, equation (16) does 
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not hold for redshift-space quantities; however, it turns out that the deviations from equation (16) 
are small (Scoccimarro, Couchman, & Fricman 1999). The dot-dashed curve in Figure 1 shows the 
effect on Q expected for l/h\ = 1.25 and 62/^*1 = —0.5. 

We also compare our results below with previous determinations of the skewness S-^. One-point 
moments of the galaxy overdensity in a cell of volume V , = { 6g correspond to integrals over 
the p— point correlation functions ^p, 

^p = ^ J^d^xi...d^xp^p. (17) 

These moments also display hierarchical scaling to leading order in nonlinear perturbation theory, 

^p = Spe-\ (18) 

where the parameters Sp depend on the shape of the volume and on the spectral index. For a 
spherical tophat filter and power-law spectrum P ~ fc", the skewness 83 is (Fry & Gaztahaga 1993) 

362 



34 , 

y-(3 + n) 



where the result in brackets corresponds to the skewness in the mass induced by gravity from 
Gaussian initial conditions (Juszkiewicz et al. 1993; Bernardeau 1994). 

The relation of equation (16) assumes the bias is local and deterministic. Locality here means 
that the scales we consider below, k < 0.2/iMpc~^, are assumed to be much larger than the char- 
acteristic scale of galaxy formation, where nonlocal processes other than gravity become important. 
Deterministic bias assumes that the scatter in the relation between Sg and 6 can be neglected, with 
biasing described by the mean relation in equation (15). Deviations from deterministic bias can 
be introduced by adding a random field e, whose variance describes the scatter about the mean, 
to the right side of equation (15). However, at least for models in which the scatter is local, that 
is, uncorrelated at large scales, {e{xi)e{x2)) oc Suixi — X2), the use of equation (15) together 
with bispectrum measurements to constrain bias parameters results in a good recovery of the mean 
biasing relation (Paper I). 

We note that the use of the bispectrum to recover information about galaxy bias does not 
depend exclusively on the validity of equation (16); other biasing schemes may predict other rela- 
tionships between galaxy and dark matter bispectra that can similarly be tested. For example, in 
non-Gaussian models, bias and non-Gaussianity interact in a non-trivial way (Fry & Scherrer 1994). 
We will consider an example of this class of models below when we discuss initial conditions. 
Alternatively, one can consider models in which the bias is local in Lagrangian space (e.g. Mo & 
White (1996)) rather than Eulerian space as assumed in equation (15). In this case, the conditions 
for galaxy formation depend on the linearly extrapolated density field [and thus equation (15) holds 
for the initial field configurations which are then evolved by gravity], and the prediction for the 
reduced bispectrum in equation (16) changes to (Catelan et al. 1998) 

1^ ^^2^/^^l-l^ ^Ql2PiP2 + AQ23P2P3 + AQ31P3P1 

Q, = yQus + ^ + {^-^) P1P2 + P2P3 + P3P1 ' 
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where AQjj = {ki/kj + kj / ki) cos 9ij . The additional source of shape dependence in the third 
term can then be tested against observations. It is interesting to note that the difference between 
Eulerian and Lagrangian bias docs not show up in large-scale measurements of the power spectrum; 
to leading (linear) order, both schemes agree. It is the sensitivity of the bispectrum to non-linear 
effects that makes possible to distinguish between these schemes. 



We apply our analysis to three IRAS redshift subsamples: the updated QDOT survey (Ef- 
stathiou et al. 1990) and the 2 Jy (Strauss et al. 1992) and 1.2 Jy (Fisher et al. 1995) redshift 
surveys. The QDOT survey chooses at random one in six galaxies from the IRAS point source 
catalog with a 60 /um flux /go > 0.6 Jy. In this sample there are 1824 galaxies with galactic latitude 
|6| > 10°, with redshifts that correspond to distances 20h^^y[pc < R < 500/i^^Mpc. We have 
used a revised version of the QDOT database in which a redshift error that afflicted approximately 
200 southern galaxies has been corrected, and we have converted all redshifts to the local group 
frame. The other two samples are shallower but denser than the QDOT catalog. The 2 Jy catalog, 
complete to a flux limit /go > 2 Jy, contains 2072 galaxies; the 1.2 Jy catalog, with /go > 1.2 Jy, 
contains 4545 galaxies. Although all three catalogs are large-angle surveys, they are not full-sky: 
because of obscuration due to the Galactic plane, the existence of some bright sources, and other 
technical details, these surveys cover a solid angle of 9.282 sr, about three quarters of the sky. In a 
subsequent paper (Paper III) we will analyze the IRAS PSCz survey (Saunders et.al. 2000). 

To deal with the angular mask and the redshift distribution, we follow the formalism developed 
by FKP, computing moments of a weighted transform of the difference between the data and a 
synthetic random Poisson catalog with the same radial distribution and angular mask. 



where ng{x) = d^ix — Xi) is the point distribution for the galaxy data and ns{x) similarly for 
the synthetic catalog. The factor a = Ng/Ng scales the synthetic catalog to the same density as the 
data, and 'w{x) = l/[l-|-n(£c)Po] is the "optimal" weight function for computing the power spectrum 
(and bispectrum as well, see Paper I); for detailed analysis of the formalism we follow, see FKP. The 
subtraction removes a contribution to the power from the shape of the radial selection function. 
The Fourier transform is performed precisely, summed over objects at their exact positions, rather 
than using interpolation and a fast Fourier transform, for frequencies that are uniformly spaced 
on a grid, k = (rix, ny,nz)kf, where kf = 0.005 /iMpc ^. The results in this paper are obtained 
by setting the fiducial power in the weight function to Pq = 8000, 2000, and 2000 (/i~^Mpc)^ for 
QDOT, 2 Jy, and 1.2 Jy respectively. These values of Pq reflect the different effective depths of the 
samples; in practice, the inferred value of Q is not very sensitive to this parameter. 

Including contributions from discreteness, the second moment of F is related to the power 



3. Catalogs and Optimally Weighted Fourier Transform 




(21) 
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spectrum of the galaxy distribution as 

= / ^\Gik')\'P{\k-k'\) + {l + a) J d^xw'n, (22) 

where the window function G{k) is 

G(fe) = J d^xw{x)n{x)e-'^-'' , (23) 

and n{x) = {ng{x)) is the mean space density of galaxies, given the angular and luminosity 
selection criteria. As noted above, for anisotropic survey geometries the window function G{k) is 
broad in at least one dimension; as a result, a range of scales contribute to ( \F{k)\^). For the 
IRAS catalogs, however, G{k) is a rapidly falling function of |fe|; in this case, for k not too small, 
equation (22) becomes 

(|F(fe)|2) =P(fe) J d^xw^n'^ + {l + a) J d^xw'^n . (24) 
Similarly, the three-point function of F becomes 

( F1F2F3 ) = S123 J d?x w^n^ + (Pi + P2 + P3) J d^x w^fi^ + (1 - a^) J d^x w^n. (25) 



Subtracting the discreteness contributions, our estimators for the power spectrum and bispeo 
trum are 

{\m\') ^ Jd^xw'n 

J d^xw'^n^ J d^xw'^n 



m = ^'i£M,-(^+o.)i^^^., (26) 



(FiFoF^) , - - -^^fd^xw^n? , 9, fd^xw^n , , 



J d^xvfin^ J d^xw^n^ J d^ 



x WW 



and the reduced bispectrum estimator is 



Q = (28) 

P1P2 + P1P3 + P2P3 



4. Data Analysis: Mock Catalogs 

Having obtained the bispectrum amplitude Q following the steps in the previous section, we 
are faced with interpreting the results and comparing to theoretical predictions such as those in 
Section 2. To do this, we must take into account a number of effects present in real redshift surveys 
that can potentially modify these predictions. In this section we summarize results from Paper I, 
where these effects are discussed in detail using mock catalogs constructed from perturbation theory 
and AT-body simulations. We refer the reader to Paper I for a more in-depth analysis. 
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4.1. Nonperturbative Redshift Distortions 

There are two ways in which the perturbation theory calculations of redshift space distortions 
described in Section 2 potentially need to be improved, namely, by going beyond both the plane- 
parallel and perturbative redshift mapping approximations. In the plane-parallel approximation 
assumed above, the structures being observed are considered to be sufficiently far away that the 
line of sight can be considered a fixed direction. Since the IRAS surveys cover most of the sky, 
this is not necessarily a good approximation. In this paper, we only consider the angle-averaged 
(monopole) power spectrum and bispectrum, which are very insensitive to the radial nature of the 
distortions (see Figs. 2 and 3 in Paper I). In our likelihood analysis, we nevertheless use mock 
catalogs in which the redshift-space mapping is correctly performed radially. 

By working at large scales, k < 0.2 /iMpc^^, we can apply leading order nonlinear perturbation 
theory to describe the shape depence of the bispectrum in the absence of redshift distortions 
(Scoccimarro et al. 1998). However, in redshift space, another effect beyond nonlinear dynamics 
enters: the mapping from real to redshift space is itself is nonlinear. The discussion of redshift 
distortions in Section 2 assumed a perturbative expansion for this mapping as well as the dynamics. 
The validity of this expansion is in fact rather restricted (Scoccimarro, Couchman, & Frieman 
1999); over the range of scales we consider, nonlinear corrections to the redshift-space mapping 
are non-negligible (see Fig. 3 in Paper I). We take these effects into account by using second order 
Lagrangian perturbation theory (2LPT) numerical realizations in which the redshift-space mapping 
is done exactly. 

4.2. Finite Volume Effects 

The finite volume of a survey can alter the amplitude and scale or shape dependence of the 
bispectrum, potential systematic effects that we must consider before we can directly compare 
theory with observations. [For a general discussion of the same issues in one-point statistics, see 
Szapudi, Colombi, &; Bernardeau (1999) and references therein.] Since we work at scales small 
compared to the extent of the survey, the integral constraint bias is negligible, thus there are 
basically two manifestations of finite volume effects: estimator bias and statistical bias. 

Estimator bias, the fact that the estimator of the reduced bispectrum in equation (28) does 
not obey (Q) = Q, has two significant sources. Although the power spectrum and bispectrum 
estimators are themselves unbiased, Q is a non-linear combination of them, thus an estimation 
bias arises (Hui & Gaztaiiaga 1999; Szapudi, Colombi, & Bernardeau 1999). In Paper I, we found 
that taking into account the estimator bias is important to meaningfully compare the bispectrum 
obtained in different surveys. However, this is not the full story. One can consider these estimators 
as random fields, each of which can be described by a probability distribution function (PDF) 
and its moments. Normally, one requires an estimator to be unbiased (a condition on its first 
moment) and of minimum variance (a requirement on the second moment). However, the higher 
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order moments of estimators can also play a role. For example, if the skewness of the estimator 
PDF is positive (negative), even if the estimator is unbiased, in a given realization the most likely 
value of the estimator will be systematically lower (higher) than the mean. We call this statistical 
bias. Since in observations we deal with a single sample, this effect must be taken into account 
(Szapudi, & Colombi 1996). 

We must also account for correlations between estimators. In an infinite universe, Fourier 
modes are statistically independent. For a survey of finite volume, translation invariance is broken, 
leading to correlations between different Fourier modes at large scales. We confine our analysis to 
sufficiently small length scales, k > 0.05/iMpc~^, that the window function of the survey can be 
approximated by a delta function (see Paper I for a detailed discussion). However, Fourier modes 
will be correlated with their neighbors in fc-space over the inverse width of the selection function 
of the survey. In addition, correlations arising from shot noise dominate at small scales. 

A proper analysis of these surveys reqires wc take these effects into consideration. In Paper I we 
have studied this problem by using mock catalogs drawn from numerical realizations of perturbation 
theory as well as iV-body simulations. In order to measure the correlation matrix of the bispectrum 
and the PDF of its estimator, many (> 400) realizations of the survey under consideration are 
necessary. This is efficiently done using a numerical implementation of second order Lagrangian 
perturbation theory (2LPT), which is orders of magnitude faster than iV-body simulations and 
reproduces the correct bispectrum at large scales, including nonlinear aspects of redshift distortions 
mentioned above. In order to investigate the domain of validity of 2LPT, we have run N-body 
simulations which confirm that, over the range of scales we consider (0.05 < k < 0.2/iMpc^^), 
2LPT is a very good approximation to the full nonlinear theory. 



4.3. Likelihood Analysis 



The survey geometry and radial selection function introduce biases and correlations that are 
best dealt with in a likelihood analysis. We proceed in the following way: Given a set of measured 
reduced bispectrum amplitudes {Qm}, m = 1, . . . , Nt, where Nt is the number of closed triangles 
in the survey, we diagonalize their covariance matrix so that the Q-eigenmodes g„. 



Nt 



Qn 



m=l 



Qm Qr, 



(29) 



satisfy 



{qnqm) = K^nm, (30) 

where Q = {Q) and (AQ)^ = ( ) —Q^. These Q-eigenmodes have "signal to noise" ratio 



1 

An 



Nt 

^ ^ 7mn 
m=l 



Qr. 



(31) 
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The physical interpretation of the Q-eigenmodes becomes clear when ordered in terms of their 
signal to noise. The (n = 1) eigenmode with highest signal to noise, S/N ~ 3, corresponds to all 
weights 7„(i > 0; it represents the overall amplitude of the bispectrum. The n = 2 Q-eigenmode, 
with S/N 1, has 7^2 > for nearly flat triangles and 7^2 < for nearly equilateral triangles — 
it represents the configuration dependence of the bispectrum. Higher order (n > 2) eigenmodes 
contain further information, such as variations of the amplitude and shape with scale; although 
they generally have S/N < 1, there are many of them, so they cannot be neglected. 

If the joint PDF of the Q m were Gaussian, then diagonalizing the covariance matrix would 
guarantee that the new eigenmodes are statistically independent — the full PDF would be the prod- 
uct of their individual PDF's. Although this is not necessarily so in the general non-Gaussian case, 
we will assume that this independence is a good approximation here (supported by the results of 
Paper I). We can then write down the likelihood as a function of the parameters, 

Nt 

>C(q;i, 0:2) oc JJ Pj[i^i(ai, "2)], (32) 

i=l 

where ai = 1/bi, a2 = h2/h\, and 

( N 1 Qi - (aiQj +"2) .„„x 
z/i (ai, a2) = — 2^ iji -T-^ . (33) 

Here the Qj (j = 1,...,A^t) are the data, and the mean Qj, standard deviation AQj, and the 

non-Gaussian PDF's Pi(fi) are extracted from the mock catalogs. As described in Section 2, the 

reduced bispectrum is mostly sensitive to the power spectrum shape, with a very small dependence 

on through redshift distortions and negligible dependence on the normalization g% at large 

scales. Our mock catalogs arc based on the ACDM model, with $7„j = 0.3, Q^tv = 0-7, and cjg = 0.7; 

in redshift space, for this model erg = 0.84, in agreement with the observed normalization for IRAS 

galaxies (Fisher et al. 1994). The power spectrum shape of this model is also in good agreement 

with the observed IRAS galaxy power spectrum over the range of scales we consider (Feldman, 

Kaiser, &; Peacock 1994; Fisher et al. 1993). If we change the power spectrum shape, the resulting 

bias parameters do change, as the prediction for Q is sensitive to spectral index; for example, for 

a shape parameter T = 0.5, as in the SCDM model, Q is related to that in the ACDM model 

(r = 0.21) by approximately Q{T = 0.21) ^ 1.3Q{T = 0.5) - 0.05 (Fig. 5, Paper I), so that 
fjSCDM ^ 5ACDM/i_3 ^ (^^^/f^2^ACDM _ Q^Q^/ifCDM . ^^^^^ ^-^^ inferred bias 

parameters for IRAS galaxies would be smaller if we assumed SCDM rather than the ACDM 
model. To assess the sensitivity of our results to a change in we also consider mock catalogs 
for a model with 57^ = 1 but with the same power spectrum normalization and shape as the ACDM 
model above. We do not consider mock catalogs with different values of cjg, since the dependence 
of Q on this parameter is very weak (see Paper I). The main effect of a change in as would be 
to decrease (increase) the error bars we infer if IRAS galaxies have linear bias parameter 61 < 1 
(61 > 1), due to shot noise reduction (increase). In this regard, the error bars we derive on the bias 
parameters can be considered conservative. 
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Important results obtained from results of mock catalogs as described in Paper I include 

• Estimator bias can be as large as a factor of two. However, equation (16) continues to hold 
for biased estimates. 

• The PDF of the bispectrum estimator is generally skewed and has non-Gaussian exponential 
tails or even power-law tails in the case of sparse sampling (QDOT) and initial conditions. 

• The cross-correlation coefficient induced by the window function of the survey and shot-noise 
between different power spectrum bins or different triangles is not significantly less than unity. 

A maximum likelihood method to constrain bias parameters from redshift surveys has been 
proposed before by Matarrese, Verde, & Heavens (1997). However, their method does not take into 
account these effects and thus cannot be applied as it stands to IRAS surveys. 

5. Results 

The bispectrum is defined for closed triangles in Fourier space. After averaging over orien- 
tations, we can characterize a given triangle by the lengths of its three sides, ki, k2, ks, where 
we order ki > k2 > k-^, or by any combination of parameters that completely defines a triangle. 
Figures 2, 3, and 4 show the scale dependence of Q{ki) for the QDOT, 2 Jy, and 1.2 Jy surveys, for 
triangles in which the two shorter sides /c2 and k^ are separated by angle 623, i.e., ^2 • ^3 = cos ^23- 
The four panels show results for four ranges of the angle ^23; small ^23 corresponds to nearly flat 
triangles, i.e., all three sides almost colinear, while triangles with larger ^23 are more open. The 
results at very small k are sensitive to the subtraction of power in the selection function, so we have 
constrained the smallest side of the triangle to have ks > 0.05 ^Mpc~^. The scatter in the values 
of Q is not small, but it is still apparent that there is a nonzero positive signal. Neighboring points 
are clearly correlated; for this reason we have not included error bars that describe the dispersion 
of values that enter into each point. A full treatment of triangle correlations, as described in § 4, is 
carried out in the analysis below. Dashed lines show the median and 68% width for the points in 
each panel. The average value of Q in each window generally decreases as ^23 increases, as expected 
from gravitational instability, even for QDOT, where the scatter is largest. 

Figures 5, 6, and 7 show the shape dependence Q{6) for triangles with sides with two sides of 
ratio 0.4-0.6 separated by angle 9. The solid curves show the tree- level (leading order) prediction 
from gravitational instability in perturbation theory with a spectral index n = —1.4 (as in FKP), 
including perturbative redshift distortions (eq. [8]). The long-dashed curves show Q averaged over 
a large number of synthetic 2LPT catalogs, and includes nonperturbative effects. Different symbols 
show results for ki = 0.1-0.125 ^Mpc~^ (triangles), ki = 0.125-0.15 ^Mpc~^ (squares), ki = 0.15- 
0.175 /iMpc~^ (diamonds), ki = 0.175-0.2 /iMpc"-*^ (circles). The signal is weak in the sparse and 
noisy QDOT survey, but in both the 2 Jy and 1.2 Jy surveys a shape dependence is apparent. 
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although there is significant dispersion, especially at nearly colinear configurations. For 2 Jy and 
1.2 Jy, open symbols show the direct result, and filled symbols arc corrected for the average finite 
volume bias. The short-dashed curves show the local bias model of equation (16) using the results of 
the likelihood analysis below, including non-linear redshift distortions, estimator bias, correlations 
between triangles, and the non-Gaussianity of the bispectrum PDF, applied to the 2LPT result. 
While the finite sample correction removes the bias in the mean, the distribution is still skewed to 
low values, and so the filled symbols cannot be directly compared to theoretical predictions. It is 
visually apparent that the skewness is larger in the 2 Jy catalog, consistent with the findings in 
Paper I. 

To assess to what extent the data follow the shape dependence predicted by gravitational 
instability and to extract quantitative bias parameters, we have carried out the likelihood procedure 
described in Section 4. There are potentially 4741 closed triangles binned with integer sides ki/kf = 
10, . . . , 40 (although the galaxy overdensity is not mapped into a grid, we use wavevectors integer 
multiples of kf = 0.005 /iMpc~^). Since the window function of the survey has a width of order 
Ak/kf ~ 5, the number of "independent" triangles is much smaller than 4741. We therefore use 
coarser bins, defined in terms of shape s, ratio r, and scale k parameters (Peebles 1980), 



where the "shape" parameter s obeys < s < 1 (ki > ^2 > ^3), the ratio 1 < r < 4, and the 
overall scale of the triangle satisfies 10 < k/kj < 40. The shape parameter is s = for isosceles 
triangles (and if r = 1 these are equilateral triangles); for s = 1 we have colinear triangles. Using 
10 bins in each variable plus the closed triangle constraint yields 203 triangles. We find that using 
a finer binning leads to a more unstable behavior of numerical computation of the Q eigcnvahies 
and eigenvectors, while making the binning too coarse loses information on the shape and scale 
dependence of the bispectrum. 

Figure 8 shows the results of the likelihood analysis for the 2 Jy and 1.2 Jy catalogs. Due to 
the sparse sampling (and thus large shot noise errors), the bias parameters cannot be significantly 
constrained from the QDOT survey, so we do not show results for it. The bottom panel in Figure 8 
shows 68% and 90% likelihood contours as a function of I/61 and 62/61, assuming Gaussian initial 
conditions and matter density = 0.3. The maximum likelihood analysis yields the results 
in Table 1, with error bars obtained from the 68% marginalized likelihoods shown in the upper 
panels of Figure 8. Despite the apparently different amplitudes of Q in Figures 6 and 7, the 
bias parameters obtained for the two catalogs are remarkably similar. The differences between 
Figures 6 and 7 are largely due to different finite volume effects in the two surveys, which trace to 
their different selection functions. In particular, the shallower 2 Jy catalog is more strongly affected 
by finite volume corrections and skewness of the bispectrum PDF than the 1.2 Jy survey. 

Figure 9 shows the effect of changing the matter density from $7^ = 0.3 to Qm = 1 in the case 
of the 1.2Jy catalog (similar results hold for the 2 Jy catalog). As expected from the analytic results 
described in Section 3 and illustrated in Figure 1, the results given in Table 1 change by about 10% 



s = 



ki — k2 




k = ks, 



(34) 
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in the direction consistent with Figure 1: for = 1 the expected configuration dependence is 
shghtly stronger, and thus the bias must be shghtly larger to account for the same measured value. 
Note however that the 0,m dependence is not nearly as strong as in redshift-space power spectrum 
measurements which determine the combination /3 ~ Q^'^/bi; to retain the same value of /?, our 
result would have to come down to 1/bi = 0.56 for 0^ = 1. Thus, to a large extent, measurements 
of the bispectrum can be used to lift the degeneracy present in the power spectrum. 

We found it useful to track how the results change as we vary the number of eigenmodes kept 
in the analysis. For the two highest S/N eigenmodes, which represent the overall amplitude and 
configuration dependence of the bispectrum, the maximum likelihood values for the bias parameters 
are very similar to those in Table 1, but with larger error bars. As we increase the number of 
eigenmodes from 2 to 203, the likelihood contours slowly contract (these additional eigenmodes 
all have signal to noise less than unity in our fiducial model); the maximum likelihood bias values 
for the 2 Jy catalog hardly change, whereas in the 1.2 Jy survey the maximum likelihood slowly 
oscillates in the linear bias direction within the range l/6i = 1.0-1.3. 

Since these additional eigenmodes carry information on the scale dependence of the reduced 
bispectrum, they can help to constrain non-Gaussian initial conditions, as exemplified by the 
model (Linde &; Mukhanov 1997; Antoniadis et al. 1997; Peebles 1997, 1999a,b), in which the linear 
density field is the square of a Gaussian field. As a result, the initial bispectrum is non-zero and has 

3 /2 

a different scaling, Bi P- , and shape dependence from the gravitionally induced bispectrum in 
equation (4). However, in order to compare with observations, linear extrapolation of the initial 
conditions is insufficient, except at extremely large scales or high redshift, which are not probed by 
the IRAS surveys. Taking into account gravitational corrections, one finds that the overall scale- 
independence of Q is partially restored by gravity over the scales we probe; however, the amplitude 
of Q is roughly two times larger than in the Gaussian case. The configuration dependence resembles 
that of the Gaussian case, but with residual dependences on the ratios ki/kj, which would show up 
as a rather strong dependence on the variable r in equation (34) (Scoccimarro 2000a). In order to 
test this model against observations, we must take into account the effects of biasing as well, which 
are non-trivial for general non-Gaussian models. For the particular case of initial conditions, 
however, a simple result follows, 

1 b^^ 

Qg = yQ + ^, (35) 

where b"^ ~ {3 — 8 / \'^Tr)b2 ~ 1.53 62 (Scoccimarro 2000a). As we did in the Gaussian case, we shall 
assume that the same relation approximately holds in redshift-space as well. We have run 2LPT 
realizations and created mock 1.2 Jy catalogs for initial conditions, as described in Paper I. We 
apply the same likelihood analysis as in the Gaussian case, with appropriate change for the mean, 
variance, and PDF of the bispectra in equation (32). The results of this analysis are shown in 
Figure 10. If we consider only the two highest S/N eigenmodes, which probe the overall amplitude 
and configuration dependence of the bispectrum, we obtain the 68% contour shown in the bottom 
panel, which gives 1/bi = 1.42lQ4g and 62/6^ = —1.40 ± 0.58, consistent with the fact that the 
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overall amplitude of Q is larger than in the Gaussian case, and the configuration dependence is 
about the same. However, including the remaining eigenmodes, which are sensitive to r and k 
dependence, the maximum likelihood is driven to 1/hi = 0, in an attempt to cancel the r and 
k dependence in the model not seen in the data. If we consider that IRAS galaxies are not 
biased at large scales > 1), then the 1.2 Jy bispectrum results rule out initial conditions 

at the 95% confidence level. We note that this model may be considered 'strongly' non-Gaussian; 
models with substantially 'weaker' initial non-Gaussianity (e.g., with negligibly small initial three- 
and four-point correlations) are not excluded by this analysis. 

We can also use this data to constrain the Lagrangian local bias model. This model also 
introduces additional dependence of the bispectrum on r, through the last term in equation (20), 
although with a much smaller amplitude than in the model. We can therefore try to test 
whether the local Lagrangian or the Eulerian model is preferred. We have repeated the likelihood 
analysis using equation (20) in place of equation (16) to incorporate the change in the mean of Q 
as a function of biasing parameters. We find that the goodness of fit, given by the value of the 
likelihood function at its maximum, docs not change from one model to the other by more than 
50%. The largest change is obtained for the 2 Jy survey, shown in Figure 11 , where we compare 
the Eulerian result (same as in Figure 8) to the Lagrangian case (for the ACDM parameters). A 
similar behavior as in the model is observed: the contours shift towards higher bias, as listed 
in Table 1. These values of the bias parameters are consistent with those obtained in the Eulerian 
model. A larger dataset is required to distinguish between these two biasing schemes. 



6. Conclusions 

We have presented results for the reduced bispectrum Q of three IRAS redshift catalogs, com- 
puted using the optimal weighting scheme of FKP, after subtracting contributions to the power 
from discreteness and from the shape of the selection function. That Q is nonzero, positive, and 
generally of the shape predicted by gravitational instability is unmistakable. The ability to extract 
values for bias parameters is marginal; however, after taking into account non-linear redshift dis- 
tortions and finite volume effects, likelihood analysis yields reasonable values for the parameters 
of a local Eulerian bias model, 1/hi = 1.32l°|^, 1.15tg|^ and hi/hi = -0.571^;^^, -0.50tg|}, for 
2 Jy and 1.2 Jy respectively. This implies that IRAS galaxies trace dark matter with decreasing 
effective b function of the mass density fluctuation amplitude, consistent with their being 

under-represented in the cores of clusters. We have also considered constraints on non-Gaussian 
initial conditions, using as an example the x^ model, in which the bispectrum develops a different 
amplitude and scale dependence than in the Gaussian case. If IRAS galaxies are not positively 
biased at large scales (i.e., if b{^^^ < 1), then x^ initial conditions are ruled out at the 95% 
confidence level. 

The likelihood contours show that one combination of the bias parameters is known to better 
accuracy: the long axis of the likelihood contours in Figures 8-11 follows tracks of roughly constant 
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skewness, ^3 = 3.25/61+862/^^ for n = —1.4. Our values inferred for S3 are consistent with previous 
work on the skewness of IRAS galaxies. For the 1.2 Jy sample, Bouchet et al. (1993) obtained 
^3 = 1.5 lb 0.5; for the same catalog, Kim &: Strauss (1988), who include a finite volume correction 
but use a different estimator, obtained S3 = 2.83 ± 0.09. Our results translate via equation (19) 
(assuming n = —1.4) into a finite- volume corrected value S3 2.25 it 0.9, comparable to that of 
Kim &; Strauss (1988). To estimate the error bar in the skewness we assume that S3 corresponds 
to sum of Q over all triangles. The error is then roughly given by three times the width in the best 
determined direction in the ellipse in Figure 8, ±0.3. An approximate estimate of 5*3 that does not 
include the finite volume corrections can be obtained directly from Figure 8, by using the fact that 
at the 68% limit, Q/bi +62/61 is in the range 0.2-0.8; this implies ^3 « 3Qg « 1.5 + 0.9, consistent 
with Bouchet et al. (1993). These are only rough estimates, as they ignore the skewness of the 
bispectrum (and S3) PDF. 

Our results have interesting implications for the relative clustering of IRAS and optically 
selected galaxies. From the ratio of power spectrum amplitudes between the Stromlo-APM survey 
and the combined QD0T-1.2Jy survey, Tadros & Efstathiou (1996) conclude that 6f V^i^^'^ ~ 1-2, 
while from the analysis of the APM projected three-point function, Frieman &: Gaztaiiaga (1999) 
conclude that 6°^* ~ 1. These results are consistent with our findings for 6{^^'^. Future galaxy 
surveys will be able to quantify this issue with a greater level of accuracy. 

We thank Stephane Colombi, Enrique Gaztaiiaga, Lam Hui, Simon Prunet, and Matias Zal- 
darriaga for useful discussions. Research supported in part by the NSF EPSCoR Grant at Kansas 
and the University of Kansas GRF, by the DOE and NASA grant NAG 5-7092 at Fermilab, and by 
NASA NAG5-2835 at the University of Florida. Parts of this work took place at the Aspen Center 
for Physics. 
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Table 1. Maximum Likelihood Estimates of Bias Parameters 
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Fig. 1. — The dependence of Q on configuration shape expected from perturbation theory, for 
configurations with ^2/^1 = \ separated by angle 6, for a power spectrum P{k) ~ A;" with n = —1.4. 
The sohd curve shows the result in real space, the short-dashed curve in redshift space for Vtm = 0.3, 
and the long-dashed curve in redshift space for 0.^ = 1. For Vtm the dotted curve applies in 
both real and redshift space. The dot-dashed curve shows the effect of a local bias as in equation 
(16), with 1/61 = 1.25 and 62/6? = -0.5. 
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Fig. 2. — Q vs. k = ki for the QDOT catalog for triangles with two sides of a specified ratio 
separated by angle 9. The four panels show bands in 6, from nearly colinear, < 023/tt < 0.2 
(upper left), through 0.6 < ^23/'?'" < 0.7 (lower right). In each panel, Triangles show k2/ki = 0.25- 
0.5; squares show k2/ki = 0.5-0.75; circles show k2/ki = 0.75-1.0. The dashed lines in each panel 
show the median and 68% range of the points. 
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Fig. 3. — Same as in Fig. 2, for the 2 Jy catalog. 
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Fig. 4. — Same as in Fig. 2, for the 1.2 Jy catalog. 
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Fig. 5. — Q vs. 9 for the QDOT catalog for triangles with 0.05 <k< 0.2/iMpc~ and with two 
sides of ratio 0.4-0.6 separated by angle 9. The solid curve shows Q expected from perturbation 
theory for spectral index n = —1.4 in equation (8) for 17 = 0.3. The long-dashed curve shows Q in 
redshift space averaging many 2LPT synthetic realizations using the full LCDM power spectrum, 
showing the nonperturbative corrections discussed in §4.1. Symbols show results for bands in ki: 
triangles for ki = 0.1-0.125 /iMpc"^; squares, 0.125-0.15 /i Mpc"^ diamonds 0.15-0.175 ZiMpc"^; 
and circles 0.175-0.2 /iMpc"^ 
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Fig. 6. — Same as Fig. 5, for the 2 Jy catalog. Open symbols show raw results; filled symbols 
after correcting for the average finite volume bias. The short-dashed line shows the result expected 
for 1/6 = 1.32, 62/^^ = —0.57 applied to the 2LPT result. Note that the short-dashed line is not 
obtained by fitting directly to the filled symbols; since the PDF of Q is positively skewed low values 
of Q are more probable than high values. 




Fig. 7. — Same as Fig. 6, for the 1.2 Jy catalog. The short-dashed hne shows the result expected 
for 1/6 = 1.15, 62/52 = -0.50 apphed to the 2LPT result. 
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Fig. 8. — The bottom panel shows 68% and 90% hkehhood contours for bias parameters assuming 
Gaussian initial conditions and f^m = 0.3 for the 2 Jy and 1.2 Jy surveys. In the upper panels we 
show the marginalized likelihoods (normalization arbitrary) for 2 Jy (dashed) and 1.2 Jy (solid) 
catalogs. 
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Fig. 9. — Same as Fig. 8, but showing the dependence of the results on in the 1.2 Jy catalog. In 
the upper panels, we show the marginalized likelihoods for flm = 1 (dashed) and fim = 0.3 (solid). 
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Fig. 10. — Likelihood analysis for initial conditions in the 1.2 Jy survey. Contours for the 2 
highest S/N eigenmodes are 68%; contours for all eigenmodes are 68%, 90%, and 95%. Upper 
panels show marginalized likelihoods for 2 highest (dashed) and all (solid) eigenmodes. 




Fig. 11. — Likelihood analysis for the Lagrangian local bias model in the 2 Jy survey. Upper panels 
show marginalized likelihoods for Eulerian (dashed) and Lagrangian (solid) bias. 



